Studies of protein designability using reduced models
نویسندگان
چکیده
One the most important problems in computational structural biology is protein designability, that is, why protein sequences are not random strings of amino acids but instead show regular patterns that encode protein structures. Many previous studies that have attempted to solve the problem have relied upon reduced models of proteins. In particular, the 2D square and the 3D cubic lattices together with reduced amino acid alphabets have been examined extensively and have lead to interesting results that shed some light on evolutionary relationship among proteins. Here, additionally to the 2D square lattice, we study the 2D triangular and 3D face centered cubic (fcc) lattices, we perform designability studies using different shapes embedded in the 2D square lattice, and we use machine learning algorithms to classify binary sequences folding to highlyor poorly-designable conformations. In the first part of the thesis we extend the transfer matrix method to the 2D triangular lattice. The transfer matrix method is a highly efficient method of enumerating all conformations within a compact lattice area that has earlier been developed for the 2D square and 3D cubic lattices. In addition we also enumerated all compact conformations within simple geometries on the 2D triangular and 3D face centered cubic (fcc) lattices using a standard backtracking algorithm. In the second part of the thesis we described protein designability studies on various shapes in the 2D square lattice using a reduced hydrophobic-polar (HP) amino acid alphabet. We used a simple energy function that counted the number of H-H, H-P and P-P interactions within a restricted set of protein shapes that have the same number of residues and nonbonded contacts. We found a difference in the designabilities of different protein shapes. Finally, in the third part of the thesis we used standard machine learning algorithms to classify two classes of protein sequences. We first performed a designability study for two shapes, using a binary HP alphabet, on the 2D triangular lattice and separated highlyand poorly-designable conformations. Highly-designable conformations had many sequences folding to them with the lowest energy and poorly-designable conformations had few or no sequences folding to them. Sequences were classified as highlyor poorly-designable
منابع مشابه
Surveying determinants of protein structure designability across different energy models and amino-acid alphabets: A consensus
A variety of analytical and computational models have been proposed to answer the question of why some protein structures are more ‘‘designable’’ ~i.e., have more sequences folding into them! than others. One class of analytical and statistical-mechanical models has approached the designability problem from a thermodynamic viewpoint. These models highlighted specific structural features importa...
متن کاملThe Designability of Protein Structures: A Lattice-Model Study using the Miyazawa-Jernigan Matrix
We study the designability of all compact 3×3×3 and 6×6 lattice-protein structures using the Miyazawa-Jernigan (MJ) matrix. The designability of a structure is the number of sequences that design the structure, i.e. sequences that have that structure as their unique lowest-energy state. Previous studies of hydrophobic-polar (HP) models showed a wide distribution of structure designabilities. Re...
متن کاملDesignability of protein structures: a lattice-model study using the Miyazawa-Jernigan matrix.
We study the designability of all compact 3 x 3 x 3 and 6 x 6 lattice-protein structures using the Miyazawa-Jernigan (MJ) matrix. The designability of a structure is the number of sequences that design the structure, i.e., sequences that have that structure as their unique lowest-energy state. Previous studies of hydrophobic-polar (HP) models showed a wide distribution of structure designabilit...
متن کاملAn Analytical Approach to the Protein Designability Problem
We present an analytical method for determining the designability of protein structures. We apply our method to the case of two-dimensional lattice structures, and give a systematic solution for the spectrum of any structure. Using this spectrum, the designability of a structure can be estimated. We outline a heirarchy of structures, from most to least designable, and show that this heirarchy d...
متن کاملThe designability of protein structures.
It has been noted that natural proteins adapt only a limited number of folds. Several researchers have investigated why and how nature has selected this small number of folds. Using simple models of protein folding, we demonstrate systematically that there is a "designability principle" behind nature's selection of protein folds. The designability of a structure (fold) is measured by the number...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015